Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 418 |
| Missing cells | 845 |
| Missing cells (%) | 11.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 276.7 KiB |
| Average record size in memory | 677.9 B |
Variable types
| Numeric | 6 |
|---|---|
| Unsupported | 1 |
| Categorical | 8 |
| Text | 3 |
DatasetName has constant value "" | Constant |
Survived has 418 (100.0%) missing values | Missing |
Age has 86 (20.6%) missing values | Missing |
Cabin has 327 (78.2%) missing values | Missing |
CabinPrefix has 13 (3.1%) missing values | Missing |
PassengerId is uniformly distributed | Uniform |
PassengerId has unique values | Unique |
Name has unique values | Unique |
Survived is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
SibSp has 283 (67.7%) zeros | Zeros |
Parch has 324 (77.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-23 19:04:22.750871 |
|---|---|
| Analysis finished | 2024-03-23 19:04:28.336937 |
| Duration | 5.59 seconds |
| Software version | ydata-profiling vv4.6.5 |
| Download configuration | config.json |
PassengerId
Real number (ℝ)
UNIFORM  UNIQUE 
| Distinct | 418 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1100.5 |
| Minimum | 892 |
|---|---|
| Maximum | 1309 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 892 |
|---|---|
| 5-th percentile | 912.85 |
| Q1 | 996.25 |
| median | 1100.5 |
| Q3 | 1204.75 |
| 95-th percentile | 1288.15 |
| Maximum | 1309 |
| Range | 417 |
| Interquartile range (IQR) | 208.5 |
Descriptive statistics
| Standard deviation | 120.81046 |
|---|---|
| Coefficient of variation (CV) | 0.10977779 |
| Kurtosis | -1.2 |
| Mean | 1100.5 |
| Median Absolute Deviation (MAD) | 104.5 |
| Skewness | 0 |
| Sum | 460009 |
| Variance | 14595.167 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 892 | 1 | 0.2% |
| 1205 | 1 | 0.2% |
| 1177 | 1 | 0.2% |
| 1176 | 1 | 0.2% |
| 1175 | 1 | 0.2% |
| 1174 | 1 | 0.2% |
| 1173 | 1 | 0.2% |
| 1172 | 1 | 0.2% |
| 1171 | 1 | 0.2% |
| 1170 | 1 | 0.2% |
| Other values (408) | 408 |
| Value | Count | Frequency (%) |
| 892 | 1 | |
| 893 | 1 | |
| 894 | 1 | |
| 895 | 1 | |
| 896 | 1 | |
| 897 | 1 | |
| 898 | 1 | |
| 899 | 1 | |
| 900 | 1 | |
| 901 | 1 |
| Value | Count | Frequency (%) |
| 1309 | 1 | |
| 1308 | 1 | |
| 1307 | 1 | |
| 1306 | 1 | |
| 1305 | 1 | |
| 1304 | 1 | |
| 1303 | 1 | |
| 1302 | 1 | |
| 1301 | 1 | |
| 1300 | 1 |
Survived
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 418 |
|---|---|
| Missing (%) | 100.0% |
| Memory size | 3.4 KiB |
Pclass
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.8 KiB |
| 3 | |
|---|---|
| 1 | |
| 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 418 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 418 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 418 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 418 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 218 | |
| 1 | 107 | |
| 2 | 93 |
Name
Text
UNIQUE 
| Distinct | 418 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 34.6 KiB |
Length
| Max length | 63 |
|---|---|
| Median length | 51 |
| Mean length | 27.483254 |
| Min length | 13 |
Characters and Unicode
| Total characters | 11488 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 418 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Kelly, Mr. James |
|---|---|
| 2nd row | Wilkes, Mrs. James (Ellen Needs) |
| 3rd row | Myles, Mr. Thomas Francis |
| 4th row | Wirz, Mr. Albert |
| 5th row | Hirvonen, Mrs. Alexander (Helga E Lindqvist) |
| Value | Count | Frequency (%) |
| mr | 242 | 14.0% |
| miss | 78 | 4.5% |
| mrs | 72 | 4.2% |
| john | 28 | 1.6% |
| william | 23 | 1.3% |
| master | 21 | 1.2% |
| charles | 16 | 0.9% |
| joseph | 15 | 0.9% |
| james | 14 | 0.8% |
| henry | 14 | 0.8% |
| Other values (825) | 1202 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1309 | 11.4% | |
| r | 971 | 8.5% |
| e | 822 | 7.2% |
| a | 786 | 6.8% |
| s | 628 | 5.5% |
| i | 621 | 5.4% |
| n | 596 | 5.2% |
| l | 526 | 4.6% |
| M | 515 | 4.5% |
| o | 467 | 4.1% |
| Other values (48) | 4247 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7395 | |
| Uppercase Letter | 1738 | 15.1% |
| Space Separator | 1309 | 11.4% |
| Other Punctuation | 884 | 7.7% |
| Open Punctuation | 78 | 0.7% |
| Close Punctuation | 78 | 0.7% |
| Dash Punctuation | 6 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 971 | |
| e | 822 | |
| a | 786 | |
| s | 628 | |
| i | 621 | |
| n | 596 | |
| l | 526 | 7.1% |
| o | 467 | 6.3% |
| t | 303 | 4.1% |
| h | 257 | 3.5% |
| Other values (16) | 1418 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 515 | |
| J | 112 | 6.4% |
| A | 103 | 5.9% |
| C | 101 | 5.8% |
| E | 95 | 5.5% |
| S | 81 | 4.7% |
| H | 80 | 4.6% |
| W | 76 | 4.4% |
| B | 69 | 4.0% |
| L | 61 | 3.5% |
| Other values (14) | 445 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 418 | |
| , | 418 | |
| " | 44 | 5.0% |
| ' | 4 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 1309 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 78 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 78 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9133 | |
| Common | 2355 | 20.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 971 | 10.6% |
| e | 822 | 9.0% |
| a | 786 | 8.6% |
| s | 628 | 6.9% |
| i | 621 | 6.8% |
| n | 596 | 6.5% |
| l | 526 | 5.8% |
| M | 515 | 5.6% |
| o | 467 | 5.1% |
| t | 303 | 3.3% |
| Other values (40) | 2898 |
Common
| Value | Count | Frequency (%) |
| 1309 | ||
| . | 418 | 17.7% |
| , | 418 | 17.7% |
| ( | 78 | 3.3% |
| ) | 78 | 3.3% |
| " | 44 | 1.9% |
| - | 6 | 0.3% |
| ' | 4 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11488 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1309 | 11.4% | |
| r | 971 | 8.5% |
| e | 822 | 7.2% |
| a | 786 | 6.8% |
| s | 628 | 5.5% |
| i | 621 | 5.4% |
| n | 596 | 5.2% |
| l | 526 | 4.6% |
| M | 515 | 4.5% |
| o | 467 | 4.1% |
| Other values (48) | 4247 |
Sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.3 KiB |
| male | |
|---|---|
| female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.7272727 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1976 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | male |
| 4th row | male |
| 5th row | female |
Common Values
| Value | Count | Frequency (%) |
| male | 266 | |
| female | 152 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 266 | |
| female | 152 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1976 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1976 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 570 | |
| m | 418 | |
| a | 418 | |
| l | 418 | |
| f | 152 | 7.7% |
Age
Real number (ℝ)
MISSING 
| Distinct | 79 |
|---|---|
| Distinct (%) | 23.8% |
| Missing | 86 |
| Missing (%) | 20.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.27259 |
| Minimum | 0.17 |
|---|---|
| Maximum | 76 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0.17 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 21 |
| median | 27 |
| Q3 | 39 |
| 95-th percentile | 57 |
| Maximum | 76 |
| Range | 75.83 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.181209 |
|---|---|
| Coefficient of variation (CV) | 0.46845047 |
| Kurtosis | 0.083783352 |
| Mean | 30.27259 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.45736129 |
| Sum | 10050.5 |
| Variance | 201.1067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 17 | 4.1% |
| 21 | 17 | 4.1% |
| 22 | 16 | 3.8% |
| 30 | 15 | 3.6% |
| 18 | 13 | 3.1% |
| 27 | 12 | 2.9% |
| 26 | 12 | 2.9% |
| 23 | 11 | 2.6% |
| 25 | 11 | 2.6% |
| 29 | 10 | 2.4% |
| Other values (69) | 198 | |
| (Missing) | 86 |
| Value | Count | Frequency (%) |
| 0.17 | 1 | 0.2% |
| 0.33 | 1 | 0.2% |
| 0.75 | 1 | 0.2% |
| 0.83 | 1 | 0.2% |
| 0.92 | 1 | 0.2% |
| 1 | 3 | |
| 2 | 2 | |
| 3 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 6 | 3 |
| Value | Count | Frequency (%) |
| 76 | 1 | 0.2% |
| 67 | 1 | 0.2% |
| 64 | 3 | |
| 63 | 2 | |
| 62 | 1 | 0.2% |
| 61 | 2 | |
| 60.5 | 1 | 0.2% |
| 60 | 3 | |
| 59 | 1 | 0.2% |
| 58 | 1 | 0.2% |
SibSp
Real number (ℝ)
ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.44736842 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 283 |
| Zeros (%) | 67.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.89675956 |
|---|---|
| Coefficient of variation (CV) | 2.0045214 |
| Kurtosis | 26.498712 |
| Mean | 0.44736842 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.1683366 |
| Sum | 187 |
| Variance | 0.80417771 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 283 | |
| 1 | 110 | 26.3% |
| 2 | 14 | 3.3% |
| 3 | 4 | 1.0% |
| 4 | 4 | 1.0% |
| 8 | 2 | 0.5% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 283 | |
| 1 | 110 | 26.3% |
| 2 | 14 | 3.3% |
| 3 | 4 | 1.0% |
| 4 | 4 | 1.0% |
| 5 | 1 | 0.2% |
| 8 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 8 | 2 | 0.5% |
| 5 | 1 | 0.2% |
| 4 | 4 | 1.0% |
| 3 | 4 | 1.0% |
| 2 | 14 | 3.3% |
| 1 | 110 | 26.3% |
| 0 | 283 |
Parch
Real number (ℝ)
ZEROS 
| Distinct | 8 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3923445 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 324 |
| Zeros (%) | 77.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.98142888 |
|---|---|
| Coefficient of variation (CV) | 2.5014468 |
| Kurtosis | 31.412513 |
| Mean | 0.3923445 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.6544617 |
| Sum | 164 |
| Variance | 0.96320264 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 324 | |
| 1 | 52 | 12.4% |
| 2 | 33 | 7.9% |
| 3 | 3 | 0.7% |
| 4 | 2 | 0.5% |
| 9 | 2 | 0.5% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 324 | |
| 1 | 52 | 12.4% |
| 2 | 33 | 7.9% |
| 3 | 3 | 0.7% |
| 4 | 2 | 0.5% |
| 5 | 1 | 0.2% |
| 6 | 1 | 0.2% |
| 9 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.5% |
| 6 | 1 | 0.2% |
| 5 | 1 | 0.2% |
| 4 | 2 | 0.5% |
| 3 | 3 | 0.7% |
| 2 | 33 | 7.9% |
| 1 | 52 | 12.4% |
| 0 | 324 |
Ticket
Text
| Distinct | 363 |
|---|---|
| Distinct (%) | 86.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 6.8755981 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2874 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 321 ? |
|---|---|
| Unique (%) | 76.8% |
Sample
| 1st row | 330911 |
|---|---|
| 2nd row | 363272 |
| 3rd row | 240276 |
| 4th row | 315154 |
| 5th row | 3101298 |
| Value | Count | Frequency (%) |
| pc | 32 | 5.9% |
| c.a | 19 | 3.5% |
| ca | 8 | 1.5% |
| soton/o.q | 8 | 1.5% |
| sc/paris | 7 | 1.3% |
| 17608 | 5 | 0.9% |
| 2 | 5 | 0.9% |
| a/5 | 5 | 0.9% |
| w./c | 5 | 0.9% |
| f.c.c | 4 | 0.7% |
| Other values (383) | 445 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | 7.2% |
| 6 | 206 | 7.2% |
| 0 | 204 | 7.1% |
| 5 | 195 | 6.8% |
| 4 | 188 | 6.5% |
| 8 | 144 | 5.0% |
| 9 | 137 | 4.8% |
| Other values (22) | 650 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2224 | |
| Uppercase Letter | 349 | 12.1% |
| Other Punctuation | 172 | 6.0% |
| Space Separator | 125 | 4.3% |
| Lowercase Letter | 4 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 92 | |
| P | 52 | |
| A | 51 | |
| O | 44 | |
| S | 40 | |
| T | 14 | 4.0% |
| N | 14 | 4.0% |
| Q | 12 | 3.4% |
| R | 7 | 2.0% |
| I | 7 | 2.0% |
| Other values (5) | 16 | 4.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | |
| 6 | 206 | |
| 0 | 204 | |
| 5 | 195 | |
| 4 | 188 | |
| 8 | 144 | 6.5% |
| 9 | 137 | 6.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1 | |
| r | 1 | |
| i | 1 | |
| s | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 126 | |
| / | 46 | 26.7% |
Space Separator
| Value | Count | Frequency (%) |
| 125 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2521 | |
| Latin | 353 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 92 | |
| P | 52 | |
| A | 51 | |
| O | 44 | |
| S | 40 | |
| T | 14 | 4.0% |
| N | 14 | 4.0% |
| Q | 12 | 3.4% |
| R | 7 | 2.0% |
| I | 7 | 2.0% |
| Other values (9) | 20 | 5.7% |
Common
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | |
| 6 | 206 | |
| 0 | 204 | |
| 5 | 195 | |
| 4 | 188 | |
| 8 | 144 | 5.7% |
| 9 | 137 | 5.4% |
| Other values (3) | 297 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2874 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 364 | |
| 1 | 311 | |
| 2 | 268 | |
| 7 | 207 | 7.2% |
| 6 | 206 | 7.2% |
| 0 | 204 | 7.1% |
| 5 | 195 | 6.8% |
| 4 | 188 | 6.5% |
| 8 | 144 | 5.0% |
| 9 | 137 | 4.8% |
| Other values (22) | 650 |
Fare
Real number (ℝ)
| Distinct | 169 |
|---|---|
| Distinct (%) | 40.5% |
| Missing | 1 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.627188 |
| Minimum | 0 |
|---|---|
| Maximum | 512.3292 |
| Zeros | 2 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.2292 |
| Q1 | 7.8958 |
| median | 14.4542 |
| Q3 | 31.5 |
| 95-th percentile | 151.55 |
| Maximum | 512.3292 |
| Range | 512.3292 |
| Interquartile range (IQR) | 23.6042 |
Descriptive statistics
| Standard deviation | 55.907576 |
|---|---|
| Coefficient of variation (CV) | 1.5692391 |
| Kurtosis | 17.921595 |
| Mean | 35.627188 |
| Median Absolute Deviation (MAD) | 6.825 |
| Skewness | 3.6872133 |
| Sum | 14856.538 |
| Variance | 3125.6571 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7.75 | 21 | 5.0% |
| 26 | 19 | 4.5% |
| 8.05 | 17 | 4.1% |
| 13 | 17 | 4.1% |
| 10.5 | 11 | 2.6% |
| 7.8958 | 11 | 2.6% |
| 7.775 | 10 | 2.4% |
| 7.2292 | 9 | 2.2% |
| 7.225 | 9 | 2.2% |
| 7.8542 | 8 | 1.9% |
| Other values (159) | 285 |
| Value | Count | Frequency (%) |
| 0 | 2 | 0.5% |
| 3.1708 | 1 | 0.2% |
| 6.4375 | 2 | 0.5% |
| 6.4958 | 1 | 0.2% |
| 6.95 | 1 | 0.2% |
| 7 | 2 | 0.5% |
| 7.05 | 2 | 0.5% |
| 7.225 | 9 | |
| 7.2292 | 9 | |
| 7.25 | 5 |
| Value | Count | Frequency (%) |
| 512.3292 | 1 | 0.2% |
| 263 | 2 | 0.5% |
| 262.375 | 5 | |
| 247.5208 | 1 | 0.2% |
| 227.525 | 1 | 0.2% |
| 221.7792 | 3 | |
| 211.5 | 4 | |
| 211.3375 | 1 | 0.2% |
| 164.8667 | 2 | 0.5% |
| 151.55 | 2 | 0.5% |
Cabin
Text
MISSING 
| Distinct | 76 |
|---|---|
| Distinct (%) | 83.5% |
| Missing | 327 |
| Missing (%) | 78.2% |
| Memory size | 15.8 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 3 |
| Mean length | 4.0769231 |
| Min length | 1 |
Characters and Unicode
| Total characters | 371 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 62 ? |
|---|---|
| Unique (%) | 68.1% |
Sample
| 1st row | B45 |
|---|---|
| 2nd row | E31 |
| 3rd row | B57 B59 B63 B66 |
| 4th row | B36 |
| 5th row | A21 |
| Value | Count | Frequency (%) |
| f | 4 | 3.4% |
| b57 | 3 | 2.5% |
| b63 | 3 | 2.5% |
| b66 | 3 | 2.5% |
| b59 | 3 | 2.5% |
| c27 | 2 | 1.7% |
| e46 | 2 | 1.7% |
| c6 | 2 | 1.7% |
| c78 | 2 | 1.7% |
| b45 | 2 | 1.7% |
| Other values (80) | 92 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 43 | |
| 5 | 34 | |
| 1 | 33 | 8.9% |
| B | 32 | 8.6% |
| 6 | 30 | 8.1% |
| 3 | 28 | 7.5% |
| 27 | 7.3% | |
| 2 | 25 | 6.7% |
| 4 | 21 | 5.7% |
| 7 | 15 | 4.0% |
| Other values (8) | 83 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 226 | |
| Uppercase Letter | 118 | |
| Space Separator | 27 | 7.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 34 | |
| 1 | 33 | |
| 6 | 30 | |
| 3 | 28 | |
| 2 | 25 | |
| 4 | 21 | |
| 7 | 15 | |
| 8 | 14 | |
| 0 | 14 | |
| 9 | 12 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 43 | |
| B | 32 | |
| D | 14 | 11.9% |
| E | 12 | 10.2% |
| F | 8 | 6.8% |
| A | 7 | 5.9% |
| G | 2 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 27 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 253 | |
| Latin | 118 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 34 | |
| 1 | 33 | |
| 6 | 30 | |
| 3 | 28 | |
| 27 | ||
| 2 | 25 | |
| 4 | 21 | |
| 7 | 15 | |
| 8 | 14 | |
| 0 | 14 |
Latin
| Value | Count | Frequency (%) |
| C | 43 | |
| B | 32 | |
| D | 14 | 11.9% |
| E | 12 | 10.2% |
| F | 8 | 6.8% |
| A | 7 | 5.9% |
| G | 2 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 371 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 43 | |
| 5 | 34 | |
| 1 | 33 | 8.9% |
| B | 32 | 8.6% |
| 6 | 30 | 8.1% |
| 3 | 28 | 7.5% |
| 27 | 7.3% | |
| 2 | 25 | 6.7% |
| 4 | 21 | 5.7% |
| 7 | 15 | 4.0% |
| Other values (8) | 83 |
Embarked
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.8 KiB |
| S | |
|---|---|
| C | |
| Q |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 418 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Q |
|---|---|
| 2nd row | S |
| 3rd row | Q |
| 4th row | S |
| 5th row | S |
Common Values
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 270 | |
| c | 102 | 24.4% |
| q | 46 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 418 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 418 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 418 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 270 | |
| C | 102 | 24.4% |
| Q | 46 | 11.0% |
DatasetName
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 25.0 KiB |
| test |
|---|
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1672 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | test |
|---|---|
| 2nd row | test |
| 3rd row | test |
| 4th row | test |
| 5th row | test |
Common Values
| Value | Count | Frequency (%) |
| test | 418 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| test | 418 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 836 | |
| e | 418 | |
| s | 418 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1672 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 836 | |
| e | 418 | |
| s | 418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1672 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 836 | |
| e | 418 | |
| s | 418 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1672 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 836 | |
| e | 418 | |
| s | 418 |
Title
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 24.5 KiB |
| Mr | |
|---|---|
| Miss | |
| Mrs | |
| Master | 21 |
| Other | 2 |
Length
| Max length | 6 |
|---|---|
| Median length | 2 |
| Mean length | 2.7703349 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1158 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mr |
|---|---|
| 2nd row | Mrs |
| 3rd row | Mr |
| 4th row | Mr |
| 5th row | Mrs |
Common Values
| Value | Count | Frequency (%) |
| Mr | 243 | |
| Miss | 80 | 19.1% |
| Mrs | 72 | 17.2% |
| Master | 21 | 5.0% |
| Other | 2 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| mr | 243 | |
| miss | 80 | 19.1% |
| mrs | 72 | 17.2% |
| master | 21 | 5.0% |
| other | 2 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 416 | |
| r | 338 | |
| s | 253 | |
| i | 80 | 6.9% |
| t | 23 | 2.0% |
| e | 23 | 2.0% |
| a | 21 | 1.8% |
| O | 2 | 0.2% |
| h | 2 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 740 | |
| Uppercase Letter | 418 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 338 | |
| s | 253 | |
| i | 80 | 10.8% |
| t | 23 | 3.1% |
| e | 23 | 3.1% |
| a | 21 | 2.8% |
| h | 2 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 416 | |
| O | 2 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1158 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 416 | |
| r | 338 | |
| s | 253 | |
| i | 80 | 6.9% |
| t | 23 | 2.0% |
| e | 23 | 2.0% |
| a | 21 | 1.8% |
| O | 2 | 0.2% |
| h | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1158 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 416 | |
| r | 338 | |
| s | 253 | |
| i | 80 | 6.9% |
| t | 23 | 2.0% |
| e | 23 | 2.0% |
| a | 21 | 1.8% |
| O | 2 | 0.2% |
| h | 2 | 0.2% |
CabinPrefix
Categorical
MISSING 
| Distinct | 7 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 13 |
| Missing (%) | 3.1% |
| Memory size | 23.8 KiB |
| F | |
|---|---|
| E | |
| C | |
| G | |
| B | 19 |
| Other values (2) | 20 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 405 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | E |
| 5th row | E |
Common Values
| Value | Count | Frequency (%) |
| F | 203 | |
| E | 63 | 15.1% |
| C | 61 | 14.6% |
| G | 39 | 9.3% |
| B | 19 | 4.5% |
| D | 13 | 3.1% |
| A | 7 | 1.7% |
| (Missing) | 13 | 3.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 203 | |
| e | 63 | 15.6% |
| c | 61 | 15.1% |
| g | 39 | 9.6% |
| b | 19 | 4.7% |
| d | 13 | 3.2% |
| a | 7 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 203 | |
| E | 63 | 15.6% |
| C | 61 | 15.1% |
| G | 39 | 9.6% |
| B | 19 | 4.7% |
| D | 13 | 3.2% |
| A | 7 | 1.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 405 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 203 | |
| E | 63 | 15.6% |
| C | 61 | 15.1% |
| G | 39 | 9.6% |
| B | 19 | 4.7% |
| D | 13 | 3.2% |
| A | 7 | 1.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 405 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 203 | |
| E | 63 | 15.6% |
| C | 61 | 15.1% |
| G | 39 | 9.6% |
| B | 19 | 4.7% |
| D | 13 | 3.2% |
| A | 7 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| F | 203 | |
| E | 63 | 15.6% |
| C | 61 | 15.1% |
| G | 39 | 9.6% |
| B | 19 | 4.7% |
| D | 13 | 3.2% |
| A | 7 | 1.7% |
FamilySize
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8397129 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.519072 |
|---|---|
| Coefficient of variation (CV) | 0.82571144 |
| Kurtosis | 13.431226 |
| Mean | 1.8397129 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.1685425 |
| Sum | 769 |
| Variance | 2.3075798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 253 | |
| 2 | 74 | 17.7% |
| 3 | 57 | 13.6% |
| 4 | 14 | 3.3% |
| 5 | 7 | 1.7% |
| 7 | 4 | 1.0% |
| 11 | 4 | 1.0% |
| 6 | 3 | 0.7% |
| 8 | 2 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 253 | |
| 2 | 74 | 17.7% |
| 3 | 57 | 13.6% |
| 4 | 14 | 3.3% |
| 5 | 7 | 1.7% |
| 6 | 3 | 0.7% |
| 7 | 4 | 1.0% |
| 8 | 2 | 0.5% |
| 11 | 4 | 1.0% |
| Value | Count | Frequency (%) |
| 11 | 4 | 1.0% |
| 8 | 2 | 0.5% |
| 7 | 4 | 1.0% |
| 6 | 3 | 0.7% |
| 5 | 7 | 1.7% |
| 4 | 14 | 3.3% |
| 3 | 57 | 13.6% |
| 2 | 74 | 17.7% |
| 1 | 253 |
TicketAppearances
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.6 KiB |
| Single | |
|---|---|
| Small Group | |
| Big Group | 20 |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 7.8779904 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3293 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Single |
|---|---|
| 2nd row | Small Group |
| 3rd row | Single |
| 4th row | Single |
| 5th row | Small Group |
Common Values
| Value | Count | Frequency (%) |
| Single | 253 | |
| Small Group | 145 | |
| Big Group | 20 | 4.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| single | 253 | |
| group | 165 | |
| small | 145 | |
| big | 20 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 543 | |
| S | 398 | |
| i | 273 | |
| g | 273 | |
| n | 253 | 7.7% |
| e | 253 | 7.7% |
| 165 | 5.0% | |
| G | 165 | 5.0% |
| r | 165 | 5.0% |
| o | 165 | 5.0% |
| Other values (5) | 640 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2545 | |
| Uppercase Letter | 583 | 17.7% |
| Space Separator | 165 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 543 | |
| i | 273 | |
| g | 273 | |
| n | 253 | |
| e | 253 | |
| r | 165 | 6.5% |
| o | 165 | 6.5% |
| u | 165 | 6.5% |
| p | 165 | 6.5% |
| m | 145 | 5.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 398 | |
| G | 165 | |
| B | 20 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 165 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3128 | |
| Common | 165 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 543 | |
| S | 398 | |
| i | 273 | |
| g | 273 | |
| n | 253 | |
| e | 253 | |
| G | 165 | 5.3% |
| r | 165 | 5.3% |
| o | 165 | 5.3% |
| u | 165 | 5.3% |
| Other values (4) | 475 |
Common
| Value | Count | Frequency (%) |
| 165 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3293 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 543 | |
| S | 398 | |
| i | 273 | |
| g | 273 | |
| n | 253 | 7.7% |
| e | 253 | 7.7% |
| 165 | 5.0% | |
| G | 165 | 5.0% |
| r | 165 | 5.0% |
| o | 165 | 5.0% |
| Other values (5) | 640 |
IsAlone
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.4 KiB |
| Alone | |
|---|---|
| With Family |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 7.3684211 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3080 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alone |
|---|---|
| 2nd row | With Family |
| 3rd row | Alone |
| 4th row | Alone |
| 5th row | With Family |
Common Values
| Value | Count | Frequency (%) |
| Alone | 253 | |
| With Family | 165 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| alone | 253 | |
| with | 165 | |
| family | 165 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 418 | |
| i | 330 | |
| A | 253 | 8.2% |
| o | 253 | 8.2% |
| n | 253 | 8.2% |
| e | 253 | 8.2% |
| W | 165 | 5.4% |
| t | 165 | 5.4% |
| h | 165 | 5.4% |
| 165 | 5.4% | |
| Other values (4) | 660 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2332 | |
| Uppercase Letter | 583 | 18.9% |
| Space Separator | 165 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 418 | |
| i | 330 | |
| o | 253 | |
| n | 253 | |
| e | 253 | |
| t | 165 | 7.1% |
| h | 165 | 7.1% |
| a | 165 | 7.1% |
| m | 165 | 7.1% |
| y | 165 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 253 | |
| W | 165 | |
| F | 165 |
Space Separator
| Value | Count | Frequency (%) |
| 165 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2915 | |
| Common | 165 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 418 | |
| i | 330 | |
| A | 253 | |
| o | 253 | |
| n | 253 | |
| e | 253 | |
| W | 165 | 5.7% |
| t | 165 | 5.7% |
| h | 165 | 5.7% |
| F | 165 | 5.7% |
| Other values (3) | 495 |
Common
| Value | Count | Frequency (%) |
| 165 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3080 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 418 | |
| i | 330 | |
| A | 253 | 8.2% |
| o | 253 | 8.2% |
| n | 253 | 8.2% |
| e | 253 | 8.2% |
| W | 165 | 5.4% |
| t | 165 | 5.4% |
| h | 165 | 5.4% |
| 165 | 5.4% | |
| Other values (4) | 660 |
| PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | DatasetName | Title | CabinPrefix | FamilySize | TicketAppearances | IsAlone | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 892 | NaN | 3 | Kelly, Mr. James | male | 34.5 | 0 | 0 | 330911 | 7.8292 | NaN | Q | test | Mr | F | 1 | Single | Alone |
| 1 | 893 | NaN | 3 | Wilkes, Mrs. James (Ellen Needs) | female | 47.0 | 1 | 0 | 363272 | 7.0000 | NaN | S | test | Mrs | F | 2 | Small Group | With Family |
| 2 | 894 | NaN | 2 | Myles, Mr. Thomas Francis | male | 62.0 | 0 | 0 | 240276 | 9.6875 | NaN | Q | test | Mr | F | 1 | Single | Alone |
| 3 | 895 | NaN | 3 | Wirz, Mr. Albert | male | 27.0 | 0 | 0 | 315154 | 8.6625 | NaN | S | test | Mr | E | 1 | Single | Alone |
| 4 | 896 | NaN | 3 | Hirvonen, Mrs. Alexander (Helga E Lindqvist) | female | 22.0 | 1 | 1 | 3101298 | 12.2875 | NaN | S | test | Mrs | E | 3 | Small Group | With Family |
| 5 | 897 | NaN | 3 | Svensson, Mr. Johan Cervin | male | 14.0 | 0 | 0 | 7538 | 9.2250 | NaN | S | test | Mr | E | 1 | Single | Alone |
| 6 | 898 | NaN | 3 | Connolly, Miss. Kate | female | 30.0 | 0 | 0 | 330972 | 7.6292 | NaN | Q | test | Miss | F | 1 | Single | Alone |
| 7 | 899 | NaN | 2 | Caldwell, Mr. Albert Francis | male | 26.0 | 1 | 1 | 248738 | 29.0000 | NaN | S | test | Mr | F | 3 | Small Group | With Family |
| 8 | 900 | NaN | 3 | Abrahim, Mrs. Joseph (Sophie Halaut Easu) | female | 18.0 | 0 | 0 | 2657 | 7.2292 | NaN | C | test | Mrs | F | 1 | Single | Alone |
| 9 | 901 | NaN | 3 | Davies, Mr. John Samuel | male | 21.0 | 2 | 0 | A/4 48871 | 24.1500 | NaN | S | test | Mr | G | 3 | Small Group | With Family |
| PassengerId | Survived | Pclass | Name | Sex | Age | SibSp | Parch | Ticket | Fare | Cabin | Embarked | DatasetName | Title | CabinPrefix | FamilySize | TicketAppearances | IsAlone | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 408 | 1300 | NaN | 3 | Riordan, Miss. Johanna Hannah"" | female | NaN | 0 | 0 | 334915 | 7.7208 | NaN | Q | test | Miss | F | 1 | Single | Alone |
| 409 | 1301 | NaN | 3 | Peacock, Miss. Treasteall | female | 3.0 | 1 | 1 | SOTON/O.Q. 3101315 | 13.7750 | NaN | S | test | Miss | E | 3 | Small Group | With Family |
| 410 | 1302 | NaN | 3 | Naughton, Miss. Hannah | female | NaN | 0 | 0 | 365237 | 7.7500 | NaN | Q | test | Miss | F | 1 | Single | Alone |
| 411 | 1303 | NaN | 1 | Minahan, Mrs. William Edward (Lillian E Thorpe) | female | 37.0 | 1 | 0 | 19928 | 90.0000 | C78 | Q | test | Mrs | C | 2 | Small Group | With Family |
| 412 | 1304 | NaN | 3 | Henriksson, Miss. Jenny Lovisa | female | 28.0 | 0 | 0 | 347086 | 7.7750 | NaN | S | test | Miss | F | 1 | Single | Alone |
| 413 | 1305 | NaN | 3 | Spector, Mr. Woolf | male | NaN | 0 | 0 | A.5. 3236 | 8.0500 | NaN | S | test | Mr | E | 1 | Single | Alone |
| 414 | 1306 | NaN | 1 | Oliva y Ocana, Dona. Fermina | female | 39.0 | 0 | 0 | PC 17758 | 108.9000 | C105 | C | test | Miss | C | 1 | Single | Alone |
| 415 | 1307 | NaN | 3 | Saether, Mr. Simon Sivertsen | male | 38.5 | 0 | 0 | SOTON/O.Q. 3101262 | 7.2500 | NaN | S | test | Mr | F | 1 | Single | Alone |
| 416 | 1308 | NaN | 3 | Ware, Mr. Frederick | male | NaN | 0 | 0 | 359309 | 8.0500 | NaN | S | test | Mr | E | 1 | Single | Alone |
| 417 | 1309 | NaN | 3 | Peter, Master. Michael J | male | NaN | 1 | 1 | 2668 | 22.3583 | NaN | C | test | Master | G | 3 | Small Group | With Family |